Better errors for schema merging #607

tormeh · 2024-06-10T16:29:48Z

This is a follow-up to #521 . I finally got time to look at it all. I could polish it further without feedback, but at this point all the boilerplate is mostly done and I'm starting to encounter design questions. So I thought it best to submit it for review so I don't fly blind.

ahl

What are the design questions you've encountered?
What would you think about a single Error type rather than several?

As I recall, the goal here is to improve the the debugging story when understanding a merge failure. It looks like the only output that would change is the new log message in the "Subschemas with other stuff" case. Is that right?

typify-impl/src/convert.rs

typify-impl/src/merge.rs

ahl · 2024-06-12T22:06:56Z

typify-impl/src/merge.rs

+    #[error("Error when trying to merge the two schema objects {a:?} and {b:?}: {source}")]
+    ObjectSchemaMerge {
+        source: Box<ObjectSchemaMergeError>,
+        a: SchemaObject,
+        b: SchemaObject,


it seems sort of heavy to clone these SchemaObjects.. will the caller not already have access to them?

Yes, but how will you know which two SchemaObjects couldn't be merged if you don't have them in the error?

These clones only happen when there's an error. How big are these structs typically?

While we use a Result::Err to indicate that the merge was not viable, this isn't an unexpected or aberrant condition. It doesn't lead to a failure to generate code from a schema. It's an expected and typical condition.

typify-impl/src/merge.rs

tormeh · 2024-06-18T14:01:41Z

What are the design questions you've encountered?

Mostly how much context to include. In particular, calls to try_merge_with_subschemas have a mutable SchemaObject. If you want to know what this value was when the function is called you have to clone it before the call, so that even if there is no error you're still doing clones for error handling purposes. I opted against this, but I imagine opinions differ on the right thing to do here.

What would you think about a single Error type rather than several?

Absolutely a valid choice, although a bit annoying to change now 😅. I chose this approach because it provides the most structure, but it does require a bit of maintenance whenever something is changed... On the other side of this spectrum is stuff like Anyhow, where the error type is essentially a string. I think both have their own charm. A library should IMO never expose stringly errors like Anyhow to applications (because they can't be matched on), but these errors are never shown to the caller, so it's not that important, I guess. My intention with how I've done it is to provide almost a stack trace of what went wrong, along with the data that the different layers were called with. This should provide a great debugging experience.

As I recall, the goal here is to improve the the debugging story when understanding a merge failure. It looks like the only output that would change is the new log message in the "Subschemas with other stuff" case. Is that right?

That's what the code does, yes. There used to be an unwrap where all the errors surfaced by crashing the progenitor. Originally I wanted to print something there, but it seems to be gone now. Would it be interesting to print something somewhere else? Or potentially do something else with the error?

ahl · 2024-06-27T05:29:12Z

I appreciate you taking another run at this. I'm not convinced that this is going to be helpful enough in debugging a merge failure to warrant the ongoing burden of the cacophony of errors and variants.

What would you think about a single Error type rather than several?

Absolutely a valid choice, although a bit annoying to change now

Indeed, but it's going to be much more annoying for me to change later.

This should provide a great debugging experience.

Can you show that? Stepping back: could we start from a specific condition that's currently a challenge to debug--I expect there are many--and focus on what would make it easier to debug. If we want those conditions to be easier to debug and stay easier to debug, we'll want tests that preserve the behavior.

tormeh added 13 commits June 7, 2024 16:50

Add new subchema merge error type

35c1a04

Add NotSchemaMergeError

224bfb9

fixup

afb8b8b

Add SchemaMergeError

59acafd

Add NotSubschemaMergeError

fda93d5

Improve try_merge_with_subschemas errors

58ec269

fixup

926ddf1

More context

5899cac

Add ObjectSchemaMergeError

93055d9

Follow convention

dbd982d

More context

4bbe19a

Nice to have

40bded1

Get rid of clones

fa7ce3c

tormeh changed the title ~~Merge errors slim~~ Better errors for schema merging Jun 10, 2024

ahl reviewed Jun 12, 2024

View reviewed changes

Address review on error displays

4fb85d0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better errors for schema merging #607

Better errors for schema merging #607

tormeh commented Jun 10, 2024

ahl left a comment

ahl Jun 12, 2024

tormeh Jun 18, 2024

tormeh Jun 18, 2024

ahl Jun 27, 2024

tormeh commented Jun 18, 2024 •

edited

Loading

ahl commented Jun 27, 2024

Better errors for schema merging #607

Are you sure you want to change the base?

Better errors for schema merging #607

Conversation

tormeh commented Jun 10, 2024

ahl left a comment

Choose a reason for hiding this comment

ahl Jun 12, 2024

Choose a reason for hiding this comment

tormeh Jun 18, 2024

Choose a reason for hiding this comment

tormeh Jun 18, 2024

Choose a reason for hiding this comment

ahl Jun 27, 2024

Choose a reason for hiding this comment

tormeh commented Jun 18, 2024 • edited Loading

ahl commented Jun 27, 2024

tormeh commented Jun 18, 2024 •

edited

Loading